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The crystal structure of cyclic AMP-catabolite activator protein (CAP) from 
Escherichia coli containing cobalt(II) chloride and ammonium sulfate is 
reported at 1.97 A resolution. Each of the two CAP subunits in the asymmetric 
unit binds one cobalt(II) ion, in each case coordinated by N-terminal domain 
residues Hisl9, His21 and Glu96 plus an additional acidic residue contributed 
via a crystal contact. The three identified N-terminal domain cobalt-binding 
residues are part of a region of CAP that is important for transcription 
activation at class II CAP-dependent promoters. Sulfate anions mediate 
additional crystal lattice contacts and occupy sites corresponding to DNA 
backbone phosphate positions in CAP-DNA complex structures. 




1. Introduction 

CAP (catabolite activator protein) is a DNA-binding protein and 
transcription activator that recruits RNA polymerase to more than 
100 genes in Escherichia coli (Busby & Ebright, 1999; Lawson et al., 
2004; Zheng et al, 2004). The CAP dimer consists of two identical 
209-residue subunits, each with an N-terminal binding domain for 
allosteric effector cAMP, and a C-terminal winged helix-turn-helix 
DNA-binding motif. CAP has three defined 'activating regions', 
patches of surface-exposed residues that interact with RNA poly- 
merase (RNAP) in specific CAP-dependent promoter classes. Acti- 
vating region 1 (ARl), located on the C-terminal domain, attracts a 
flexibly tethered RNAP a subunit C-terminal domain to bind to a 
DNA site adjacent to CAP (Benoff et al, 2002; Lawson et al, 2004; 
Hudson et al., 2009). AR2, located on the N-terminal cAMP-binding 
domain, functions specifically at class II promoters and is predicted to 
make direct contact with an acidic loop of the RNAP subunit N- 
terminal domain (ofNTD^; Niu et al., 1996; Busby & Ebright, 1999; 
Lawson et al, 2004). AR3 is also located on the N-terminal domain 
and is predicted to make direct contact with RNAP a subunit region 4 
(Rhodius & Busby 2000). 

Three distinct crystal forms of native cAMP-bound CAP from E. 
coli have been reported previously: (i) space group P2i2i2i with one 
dimer in the asymmetric unit (PDB entry lg6n; McKay et al., 1982; 
Passner et al., 2000), (ii) space group P2i with one dimer in the 
asymmetric unit (PDB entry 4hzf; Rodgers et al., 2013) and (iii) space 
group PI with two dimers in the asymmetric unit (PDB entry 2gzw; 
RIKEN Structural Genomics/Proteomics Initiative, unpublished 
work). We describe here the structure of cAMP-CAP in a novel 
fourth crystal form (space group P2i2i2 with one dimer in the 
asymmetric unit) in which cobalt(II) binds to key residues of CAP 
AR2. In addition, discrete sulfate-anion-binding sites are found near 
AR2, near the cAMP-binding site and at positions analogous to DNA 
phosphate positions in CAP-DNA complexes. This crystal form of 
cAMP-CAP was fortuitously obtained during efforts to structurally 
characterize a 2:1 CAP dimer-DNA complex under conditions that 
apparently disrupted the protein-DNA interaction (Rao, 2011). 
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2. Materials and methods 

2.1. Preparation of CAP 

Full-length CAP (209 residues) was expressed and purified by 
cAMP-affinity chromatography, as described in Zhang et al. (1991), 
and further purified on a Heparin 16/10 FF column (GE Healthcare). 
CAP was then concentrated and buffer-exchanged to approximately 
360 [iM dimer (approximately 18 mg ml~^) in 20 mM Tris pH 8.0, 
0.1 mM EDTA, 1 mM DTT, 200 mM NaCl, 0.02%(Wv) NaNg. The 
protein concentration was determined using a Bradford assay. CAP 
purity was estimated to be >95%. 

2.2. Crystallization and data collection 

A 71 bp DNA duplex containing two tandem CAP sites separated 
by 32 base pairs was designed according to a promoter sequence 
described by Tebbutt et al. (2002) and was assembled using three 
oligonucleotides (Rao, 2011, p. 11). DNA oligonucleotides were 
obtained from IDT and were annealed to form duplex DNA using a 
standard laboratory procedure (Locasale et al, 2009). The CAP- 
DNA complex used in crystallization trials consisted of 0.1 mM CAP 
dimer, 0.05 mM DNA duplex, 0.4 mM cychc AMP. 

Crystallization screening was by hanging-drop vapor diffusion 
using Hampton Research crystal screens and pre-greased VDX 
plates. Optimization of Crystal Screen 2 condition No. 25 yielded 
ruby-red-colored crystals that appeared within 3 d at 20° C in 0.01 M 
C0CI2.6H2O, 0.1 M MES monohydrate pH 6.5, 2.5 M ammonium 
sulfate. 

Crystals were mounted on nylon loops and flash-cooled in liquid 
nitrogen after being submerged briefly in a cryoprotectant solution 
comprised of 66.5%{wlw) Paratone-N, 2^.5%{wlw) paraffin oil, 
S%{wlw) glycerol. X-ray diffraction data were collected on beamline 



X8C at the National Synchrotron Light Source at Brookhaven 
National Laboratory to 1.97 A resolution. 

2.3. Structure determination and refinement 

The diffraction data were processed and scaled assuming primitive 
orthorhombic lattice symmetry using the HKL-2000 package 
(Otwinowski & Minor, 1997). Molecular-replacement search models 
were prepared using selected coordinates from PDB entry Izrc 
(Napoli et al., 2006). Molecular-replacement trials were carried out in 
all possible primitive orthorhombic space groups using Phaser 
(McCoy et al., 2007). An unambiguous solution was obtained. The N- 
terminal domain dimer of CAP (residues 1-136) was positioned first, 
yielding a Z-score of 18.7 in space group P22i2i; the next highest Z- 
score was 10.0 in space group P22i2. The two C-terminal domains 
(residues 137-209) were located in subsequent Phaser runs. After the 
molecular replacement, the model coordinates and diffraction data 
hkl Miller indices were transformed to the standard P2i2i2 setting. 

Noncrystallographic twofold symmetry observed in the self-rota- 
tion function and Matthews coefficient analysis (Kantardjieff & 
Rupp, 2003; Matthews, 1968) provided additional support for the 
assignment of one complete CAP dimer in the crystal asymmetric 
unit. Upon inspection of crystal packing and likelihood-weighted 
electron density, it became evident that the crystal structure did not 
contain DNA. 

Crystallographic refinement was initially performed using 
REFMAC5 (Murshudov et al, 2011) and in the final stages using 
P HEN IX (Adams et al, 2010). NoncrystaUographic symmetry 
restraints for CAP N-terminal and C-terminal domains were main- 
tained and separately defined. The model was inspected between 
refinement runs and manually adjusted to fit likelihood- weighted 
electron-density maps using Coot (Emsley & Cowtan, 2004). Solvent 




Figure 1 

cAMP-CAP dimer with bound cobalt(II) and sulfate ligands (CAP subunits, blue/light-blue and green/lime ribbons; cAMP and sulfate, stick representation with atom-type 
colors; Co^"^, purple spheres), {a) Crystal asymmetric unit composed of one cAMP-CAP dimer, two Co^"^ ions and 13 sulfate ligands. {b) Co^^ coordination in subunit A 
(crystal lattice neighbor shown at top in orange), (c) C-terminal domain of subunit B shown with bound sulfates and selected DNA sugar-phosphate backbone fragments 
from a superimposed CAP-DNA structure [PDB entry Izrc (Napoli et al, 2006), translucent grey/pink sticks]. DNA numbering follows Fig. 1 of Lawson et al. (2004). 
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Table 1 

Crystal, data collection and refinement statistics and model information. 
Values in parentheses are for the outer resolution shell. 



Crystal data 

Crystal system, space group 

Unit-cell parameters (A) 

No. of CAP dimers in unit cell Z 

Matthews coefficient Vm (A^ Da"^) 

Solvent content (%) 
Data collection 

Diffraction source 

Wavelength (A) 

Detector 

Temperature (°C) 
Resolution range (A) 
No. of unique reflections 
No. of observed reflections 
Completeness (%) 
Multiplicity 

^merge 

Data-processing software 
Refinement 

Refinement software 
a cutoff 

Resolution range (A) 

No. of reflections used in refinement 

No. of reflections above a cutoff 

Final overall R factor 

Atomic displacement model 

Overall average B factor (A^) 

No. of protein atoms 

R.m.s.d., bond lengths (A) 

R.m.s.d., bond angles (°) 

No. of hgand atoms 

No. of solvent atoms 

Total No. of atoms 

No. of refined parameters 

NCS restraints 

Bulk-solvent model 

Final R^^,^ 

No. of reflections for R^ee 
Final R^^^ 



Orthorhombic, P2i2i2 

a = 104.14, b = 108.45, c = 44.32 

4 

2.50 
50.8 

Beamline X8C, NSLS 
1.1 

ADSC Quantum 4 
-173 

50.0-1.97 (2.03-1.97) 
35645 (3280) 
181462 
97.1 

5.1 (4.2) 
9.5 

0.088 (0.778) 

DENZO, SCALER ACK 

phenix.refine, P HEN IX 

F > 1.340cr(F) 

40.78-1.97 (2.02-1.97) 

33823 

35597 

0.194 

Individual isotropic B factors 

35.5 

3268 

0.010 

1.007 

116 

410 

3794 

15176 

Removed in final refinement cycles 
Flat; 5soi = 58.65 A^ yt^oi = 0.37 e A"' 
0.194 (0.286) 
1774 

0.229 (0.324) 



atoms were initially added manually using conservative criteria; in the 
final refinement stages, the phenix.refine water-picking protocol was 
used with default settings. Cobalt(II) and sulfate ligands were 
manually placed in positions of strong electron density at the protein 
surface that could not be satisfactorily modeled as solvent owing to 
density strength, shape and local stereochemistry criteria. 

2.4. Structure analysis 

The quality and stereochemistry of the model and fit to electron 
density were evaluated using validation tools in Coot. The CCPA suite 
(Winn et al., 2011) programs CONTACT and SUPERPOSE were 
used to evaluate contacts between protein and ligands and to perform 
superposition calculations. Molecular-graphics images were created 
using UCSF Chimera (Pettersen et al, 2004). 

3. Results 

Our efforts to characterize a CAP-DNA complex fortuitously 
yielded a crystal structure for cAMP-CAP without DNA in a novel 
crystal form in which cobalt and sulfate ligands are bound to CAP. 
This structure is shown schematically in Fig. l{a). The crystal asym- 
metric unit contains one CAP dimer (residues 7-207 of chain A and 
7-206 of chain B were modeled out of 210 total residues per subunit), 
with cAMP ligands bound as expected within the N-terminal domain 
(Berman etal, 2005), two cobalt(II) cations, 13 sulfate anions and 420 
solvent atoms. 



The main-chain torsion angles of most residues (96%) lie in 
preferred regions of the Ramachandran plot, with the remainder 
(4%) in the allowed regions. The final R factor is 19.4% and Ri^ee is 
22.9% (Table 1). 

The conformation of the CAP dimer in this new crystal form is 
essentially identical to the conformation observed in earlier structure 
reports for cAMP-CAP (McKay et al, 1982; Passner et al, 2000), with 
two cAMP-binding N-terminal domains related by local twofold 
rotational (C2) symmetry, and flexibly tethered DNA-binding C- 
terminal domains that adopt two distinct positions relative to the N- 
terminal domains. Superposition of all 400 atoms in common with 
the CAP dimer of PDB entry lg6n (Passner et al, 2000) yields a root- 
mean-square deviation (r.m.s.d.) of only 1.4 A. The 'closed' confor- 
mation of subunit A (Fig. la, blue/light-blue ribbon) closely resem- 
bles the conformation of DNA-bound CAP (Schultz et al, 1991; 
Parkinson et al, 1996). In the 'open' conformation of subunit B (Fig. 
la, green/lime ribbon), the C- terminal domain is rotated by 40° and 
translated by 15 A relative to the closed conformation. 

The full A and B subunits superimpose poorly, owing to their 
observed conformational difference (r.m.s.d. of 3.9 A). However, the 
individual N- and C-terminal domains from each subunit readily 
superimpose (r.m.s.d.s of 1.1 and 0.4 A, respectively). The N- and C- 
terminal domains of the new structure also readily superimpose with 
previously determined cAMP-CAP crystal structure domains 
[r.m.s.d.s of <0.9 and <1.5 A, respectively, from PDB entries lg6n 
(Passner et al, 2000), 4hzf (Rodgers et al, 2013) and 2gzw (RIKEN 
Structural Genomics/Proteomics Initiative, unpublished work)]. 

The two Co^^ ions bound to the CAP dimer have chemically 
similar environments on each N-terminal domain, even though the 
positions are not crystallographically equivalent (Figs, la and lb, 
purple spheres). Each has approximate octahedral coordination (Fig. 
lb), with three liganding atoms contributed by N-terminal domain 
residues Hisl9 (N"^), His21 (N"^) and Glu96 (O"^ or O"^). In each 
case, a neighboring CAP dimer related by crystal symmetry contri- 
butes two additional ligands from residue Glu37 (O^^ and O^^). 
Octahedral coordination is completed at each site by a solvent water 
O atom. The average Co^^-to-coordinating-atom bond length is 2.22 
± 0.12 A (12 measurements; range 2.03-2.44 A) and the average 
atom-to-Co^^-to-neighbor-atom bond angle is 92.0 ± 11.0° (21 
measurements; range 57.0-113.1°). 

13 sulfate anions were also identified at positions adjacent to the 
protein surface (Fig. 1; sulfates are shown in stick representation with 
sulfur in yellow and oxygen in red). Six of these mediate protein- 
sulfate-protein contacts between the CAP dimer and crystal lattice 
neighbors (Table 2). 



4. Discussion 

The identification of Co^^ tightly bound to the CAP AR2 region 
confirms the hypothesis originally put forward by Wickstrum & Egan 
(2002) that CAP can bind metals and, in addition, that residue His 19 
is involved in the metal-binding activity. Their hypothesis was based 
on the observation that CAP can effectively bind to Ni^^-charged 
chromatography media, with wild-type CAP eluting at a substantially 
higher concentration of imidazole than the CAP mutant Hisl9Ala. 

The three N-terminal domain Co^^-binding residues (Hisl9, His21 
and Glu96) constitute a major subset of the four residues previously 
identified to form CAP activating region 2 (AR2), which participates 
in transcription activation specifically at class II CAP-dependent 
promoters (Niu et al, 1996). In vivo, Hisl9Ala, His21Ala and 
LyslOlAla mutations each result in an approximately fivefold 
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Table 2 

cAMP-CAP dimer sulfate environments. 



Crystal asymmetric unit environment 


Crystal lattice 
neighbors 


CAP region 


Mic91 A T\7c99 4 M 
nisziyi i>i , i_^yszzyi IN 


T \7c^^ R xr^ 


AR2 


HisllB N^^, Lys225 N 


T v^^S/t 


AR2 


Ile20A N 




AR2 


Asp685 Glnll95 0'\ Argl225 N^N''^ 




cAMP site 


Argl235 N',N"^ 


His315 Trp85B N^^ 




HisSlA Trp85A N^^ 


cAMP site 


GluSlA O'^ 


Tlir285 N 


cAMP site 


Ala91A N 






Vall395 N, Argl425 N"^ 




DNA PO4 


Argl69A N, GlnlVOA N 




DNA PO4 


Tlirl685 Argl695 N, Glnl70B N 




DNA PO4 


Lys575 N^, Ser 1795 N,0'', Tlirl825 O'^^ 




DNA PO4 


Argl805 N,N'^\N"^ 




DNA binding 


Lys2015 N 


Ty 206A 


DNA PO4 



reduction in class II transcription activation, while Glu96Ala muta- 
tion results in an approximately threefold improvement; these same 
mutations have no effect on class I transcription activation (Niu et al. , 
1996). At class II promoters, CAP binds to a site '^—45 bp upstream 
of the transcription start site, and CAP-AR2 is predicted based on 
cross-linking analysis to contact a highly acidic loop in the N-terminal 
domain of the RNAP subunit with sequence EEDE (residues 162- 
165; Niu et al., 1996; Lawson et al., 2004). 

In the crystal structure reported here, Co^^-metal-binding coordi- 
nation by the three AR2 residues is completed through an acidic 
residue provided in 'trans' through a crystal contact. Although no 
specific metal requirement has been reported for CAP class II CAP- 
dependent transcription activation, our structure raises the possibility 
that the local environment of AR2 in contact with the acidic loop of 
the RNAP subunit could involve or accommodate metal binding. 

Several sulfates in the crystal structure adopt positions that 
correspond to phosphate sites of the DNA backbone in CAP-DNA 
crystals (Table 2, Fig. Ic). The DNA in the CAP dimer-DNA 
complexes is strongly bent and electrostatic interactions are thought 
to play an important role in DNA bending (Schultz et al., 1991; 
Gartenberg & Crothers, 1988; Kapanidis et al., 2001); our structure 
suggests that the CAP C-terminal domains have an intrinsic 
capability to bind to DNA half-sites prior to adoption by CAP of its 
fully bent DNA binding conformation. 

We thank Samuel Lara-Gonzalez and Brian P. Hudson for assis- 
tance with material preparation and data collection. The National 
Synchrotron Light Source is supported by the US Department of 
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